Sound control apparatus, program, and control method

ABSTRACT

An apparatus may include a hardware processor and a storage medium. The storage medium may be coupled to the processor, and may store instructions. When executed by the processor, the instructions may cause the apparatus to receive a direction signal indicating a first direction in which a head of a user is facing with respect to a fixed direction. The instructions may also cause the apparatus to convert the first direction in which the head is facing with respect to the fixed direction into a second direction in which the head is facing with respect to a third direction, connecting the head and an object. Additionally, the instructions may cause the apparatus to generate an audio signal having characteristics such that a sound generated from the audio signal is perceived by the user as arriving from the third direction.

BACKGROUND

The present disclosure contains subject matter related to that disclosedin Japanese Priority Patent Application JP 2011-128983, filed in theJapan Patent Office on Jun. 9, 2011, the entire content of which ishereby incorporated by reference.

The present disclosure relates to a technique of a sound controlapparatus that controls sounds from headphones and earphones, and thelike.

From the past, a technique of controlling sound signals of headphonesand earphones such that sounds are heard from a certain direction isknown.

Japanese Patent Application Laid-open No. 2008-92193 discloses atechnique of arranging a plurality of virtual sound sources related tomusic in virtual sound source space and controlling sound signals fromheadphones such that the music is heard from the plurality of virtualsound sources. For example, assuming that a user wearing headphonesfaces rightwardly from a state where he/she is facing front, music thathas been heard from the front direction when the user is facing front isheard from the left-hand direction, and music that has been heard fromthe right-hand direction when the user is facing front is heard from thefront direction.

SUMMARY

There is a need for a technique for presenting simple information on areal object (or virtual object) such as a shop to a user from adirection in which the real object (or virtual object) exists andobtaining more-specific information as the user carries out some kind ofan action with respect to the presented information.

Accordingly, there is disclosed an apparatus, which may include ahardware processor and a storage medium. The storage medium may becoupled to the processor, and may store instructions. When executed bythe processor, the instructions may cause the apparatus to receive adirection signal indicating a first direction in which a head of a useris facing with respect to a fixed direction. The instructions may alsocause the apparatus to convert the first direction in which the head isfacing with respect to the fixed direction into a second direction inwhich the head is facing with respect to a third direction, connectingthe head and an object. Additionally, the instructions may cause theapparatus to generate an audio signal having characteristics such that asound generated from the audio signal is perceived by the user asarriving from the third direction.

There is also disclosed a method. The method may include receiving adirection signal indicating a first direction in which a head of a useris facing with respect to a fixed direction. The method may also includeconverting the first direction in which the head is facing with respectto the fixed direction into a second direction in which the head isfacing with respect to a third direction, connecting the head and anobject. Additionally, the method may include generating an audio signalhaving characteristics such that a sound generated from the audio signalis perceived by the user as arriving from the third direction.

Additionally, there is disclosed a non-transitory, computer-readablestorage medium storing instructions. When executed by a processor, theinstructions may cause an apparatus to receive a direction signalindicating a first direction in which a head of a user is facing withrespect to a fixed direction. The instructions may also cause theapparatus to convert the first direction in which the head is facingwith respect to the fixed direction into a second direction in which thehead is facing with respect to a third direction, connecting the headand an object. Additionally, the instructions may cause the apparatus togenerate an audio signal having characteristics such that a soundgenerated from the audio signal is perceived by the user as arrivingfrom the third direction.

According to an embodiment, there is provided a sound control apparatusincluding a sensor unit and a controller.

The controller is configured to judge a direction of a face of a userwith respect to one of a real object present in space and a virtualobject virtually arranged in space based on a signal from the sensorunit, control an output of a sound signal such that primary informationon one of the real object and the virtual object is heard from a soundoutput unit from a direction corresponding to the direction of the facewith respect to one of the real object and the virtual object, judge anaction of the user with respect to the primary information based on thesignal from the sensor unit, and control the output of the sound signalsuch that secondary information as a content more specific than theprimary information is heard from the sound output unit according to theaction.

In the sound control apparatus, sound signals of the sound output unit(e.g., headphones or earphones) are controlled such that primaryinformation on a real object such as a shop and a building (e.g., “thereis a shop”) is heard from a direction in which the real object exists.Alternatively, assuming that a virtual object such as a mail program isvirtually arranged in space, for example, sound signals of the soundoutput unit are controlled such that primary information on the virtualobject (e.g., “mail received”) is heard from a direction in which thevirtual object is arranged.

When the user carries out an action with respect to the primaryinformation, the action is detected by the sensor unit, and soundsignals of the sound output unit are controlled such that secondaryinformation as a content more specific than the primary information(e.g., specific information on shop and main content) is heard. As theuser carries out an action with respect to the primary information,secondary information as a content more specific than the primaryinformation can be heard.

The sound control apparatus may further include a communication unit. Inthis case, the controller may measure a position of the sound controlapparatus, acquire positional information of the real object present inspace via the communication unit, judge a distance between the soundcontrol apparatus and the real object based on the positionalinformation, and control the output of the sound signal such that theprimary information on the real object is heard from the sound outputunit when the distance becomes equal to or smaller than a predeterminedthreshold value (i.e., distance).

With this structure, when the distance between the sound controlapparatus and the real object becomes equal to or smaller than apredetermined threshold value at a time the user is carrying the soundcontrol apparatus, sound signals of the sound output unit are controlledsuch that the primary information on the real object is heard from thedirection in which the real object exists. If the user wishes secondaryinformation (specific information) on the real object, the user onlyneeds to carry out an action with respect to the primary information.

In the sound control apparatus, the controller may judge, as the actionof the user with respect to the primary information, a movement ofdirecting the face in the direction from which the primary informationis heard, and control the output of the sound signal such that a volumeof the secondary information becomes larger as the direction of the faceapproaches the direction from which the primary information is heard.

In the sound control apparatus, as the user faces the direction fromwhich the primary information is heard when wishing for the secondaryinformation, the volume of the secondary information becomes larger asthe direction of the face approaches the direction from which theprimary information is heard.

In the sound control apparatus, the controller may execute a main taskand control the output of the sound signal such that the volume of thesecondary information becomes larger and a volume of the main taskbecomes smaller as the direction of the face approaches the directionfrom which the primary information is heard.

In the sound control apparatus, as the user faces the direction fromwhich the primary information is heard when wishing for the secondaryinformation, the volume of the secondary information becomes larger,and, for example, a volume of music that is executed by the main taskbecomes smaller.

In the sound control apparatus, the controller may judge a movement of ahead as the action of the user with respect to the primary informationand control the output of the sound signal such that the secondaryinformation is heard from the sound output unit when the head is movedup and down.

As a result, the user can hear the secondary information by moving thehead up and down (nodding).

In the sound control apparatus, the controller may judge, as the actionof the user with respect to the primary information, a movement of ahead in a state where the user is directing the face toward thedirection from which the primary information is heard, and judge whetherto output the secondary information as the sound signal based on themovement of the head in the state where the face is directed toward thedirection from which the primary information is heard.

With this structure, the user can select whether to hear the secondaryinformation by merely moving the head while facing the direction fromwhich the primary information is heard.

In the sound control apparatus, the controller may judge a movement ofthe head in the state where the user is directing the face toward thedirection from which the primary information is heard, and control theoutput of the sound signal such that the secondary information is heardfrom the sound output unit based on the movement of the head in thestate where the face is directed toward the direction from which theprimary information is heard.

With this structure, the user can hear the secondary information bymerely moving the head up and down (nodding) while facing the directionfrom which the primary information is heard.

In the sound control apparatus, the controller may control the output ofthe sound signal such that the secondary information is heard from adirection corresponding to the direction of the face with respect to oneof the real object and the virtual object.

In the sound control apparatus, the sound output unit may be one ofheadphones and earphones.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram showing a sound control apparatus according to anembodiment of the present disclosure;

FIG. 2 is a block diagram showing an electrical structure of the soundcontrol apparatus according to the embodiment of the present disclosure;

FIG. 3 is a flowchart showing processing of a controller of a cellular(i.e., mobile) phone according to the embodiment of the presentdisclosure;

FIG. 4 is a complementary diagram for explaining the processing shown inFIG. 3, the diagram showing a state where a user is wearing headphonesand walking on the ground;

FIG. 5 is a diagram showing an example of a setting screen displayed forthe user to set which information on a real object belonging to whatcategory is to be presented;

FIG. 6 is a flowchart showing an example of processing of the controllerthat is carried out when secondary information is heard from headphonesin accordance with a user action with respect to primary information;

FIG. 7 is a flowchart showing another example of the processing of thecontroller that is carried out when secondary information is heard fromheadphones in accordance with a user action with respect to primaryinformation;

FIG. 8 is a flowchart showing still another example of the processing ofthe controller that is carried out when secondary information is heardfrom headphones in accordance with a user action with respect to primaryinformation;

FIG. 9 is a diagram showing a sound control apparatus according toanother embodiment of the present disclosure;

FIG. 10 is a block diagram showing an electrical structure of the soundcontrol apparatus according to the another embodiment;

FIG. 11 is a flowchart showing processing of a controller of adisplay-unit-integrated PC according to the another embodiment;

FIG. 12 is a complementary diagram for explaining the processing shownin FIG. 11, the diagram showing virtual objects virtually arranged inspace;

FIG. 13 is a diagram showing a state where a user faces, after listeningto primary information on a mail program, a direction from which theprimary information is heard;

FIG. 14 is a flowchart showing processing of a sound control apparatusaccording to still another embodiment of the present disclosure; and

FIG. 15 is a complementary diagram for explaining the processing shownin FIG. 14, the diagram showing a state where the user utters a voicewhile facing a direction in which a virtual object (mail program)exists.

DETAILED DESCRIPTION OF EMBODIMENTS

Hereinafter, embodiments of the present disclosure will be describedwith reference to the drawings.

[Overall Structure of Sound Control Apparatus 100 and Structures ofComponents]

FIG. 1 is a diagram showing a sound control apparatus 100 according toan embodiment of the present disclosure. FIG. 2 is a block diagramshowing an electrical structure of the sound control apparatus 100. Asshown in the figures, the sound control apparatus 100 includes a soundcontrol apparatus body 10 (cellular phone 10), headphones 20 (soundoutput unit) connected to the sound control apparatus body 10, and asensor unit 30 attached to the headphones 20. In the first embodiment,the cellular phone 10 is taken as an example of the sound controlapparatus body 10.

The cellular phone 10 includes a controller 11, a display unit 12, aninput unit 13, an antenna 14, a communication unit 15, and a storage 16.The cellular phone 10 also includes a communication speaker and acommunication microphone (not shown).

The display unit 12 is constituted of, for example, a liquid crystaldisplay or an EL (Electro-Luminescence) display. The input unit 13includes a touch sensor that detects a user operation made by a finger,a stylus pen, and the like with respect to the display unit 12, and aninput button provided on the cellular phone 10.

The storage 16 includes a volatile memory (e.g., RAM (Random AccessMemory) and a nonvolatile memory (e.g., ROM (Read Only Memory). Thevolatile memory is used as a working area of the controller 11 andtemporarily stores programs used for processing of the controller 11 anddata such as music data and video data. The nonvolatile memory fixedlystores various programs requisite for processing of the controller 11and data such as music data and video data. The programs stored in thenonvolatile memory may be read out from a removable recording mediumsuch as an optical disc and a semiconductor memory.

The controller 11 is constituted of a CPU (Central Processing Unit(i.e., hardware processor)) and the like. The controller 11 executesvarious operations based on the programs (i.e., instructions) stored inthe storage (i.e., storage medium) 16.

The communication unit 15 executes processing of converting a frequencyof radio waves transmitted and received by the antenna 14, modulationprocessing, demodulation processing, and the like. The antenna 14transmits and receives communication radio waves and radio waves forpacket communication for emails, web data, and the like.

The communication unit 15 is capable of communicating with aninformation management server (not shown). The information managementserver stores positional information of a real object 1 (e.g., shop andbuilding) (see FIG. 4) present in space and primary and secondaryinformation on the real object 1. The information management server alsotransmits, in response to requests from the cellular phone 10, thepositional information of the real object 1 and the primary andsecondary information on the real object 1.

The primary information on the real object 1 (e.g., shop and building)is simple information on the real object 1. On the other hand, thesecondary information on the real object 1 is specific information onthe real object 1. For example, the primary information is a shop name,a shop type, and the like in a case where the real object 1 is a shopand is a building name, a building type, and the like in a case wherethe real object 1 is a building. For example, the secondary informationis a recommended product, opening hours, and the like in the case wherethe real object 1 is a shop and is a specific history and the like of abuilding in the case where the real object 1 is a building.

The sensor unit 30 is provided at, for example, a top portion of theheadphones 20. The sensor unit 30 includes a direction sensor (e.g.,geomagnetic sensor) that detects a direction of a face of a user (i.e.,a direction in which a head of the user is facing) on earth (i.e., withrespect to a fixed direction) and a motion (i.e., movement) sensor thatdetects a movement of a head of the user. Examples of the motion sensorinclude an angular velocity sensor (e.g., vibration gyro sensor androtation top tyro sensor) and an acceleration sensor (e.g.,piezoresistive sensor, piezoelectric sensor, and capacitance sensor). Acombination of the angular velocity sensor and the acceleration sensormay be used as the motion sensor.

The sensor unit 30 includes an A/D conversion unit. Signals output fromthe direction sensor and the motion sensor (e.g., direction and movementsignals, respectively) are A/D-converted and input to the controller 11of the cellular phone 10.

Although the cellular phone 10 and the headphones 20 are connected bywires in FIGS. 1 and 2, the cellular phone 10 and the headphones 20 maycommunicate wirelessly.

[Descriptions on Operations]

Next, processing of the controller 11 of the cellular phone 10 of thisembodiment will be described. FIG. 3 is a flowchart showing theprocessing of the cellular phone 10 (controller 11) of this embodiment.FIG. 4 is a complementary diagram for explaining the processing shown inFIG. 3, the diagram showing a state where a user is wearing theheadphones 20 and walking on the ground.

First, the user walks on the ground while wearing the headphones 20. Inthis case, the user may walk while listening to sounds of music data orvideo data stored in the cellular phone 10.

As shown in FIG. 3, the controller 11 first receives radio waves (i.e.,head position signals) from a GPS satellite via the antenna 14 and thecommunication unit 15 and measures a position of the user (position ofcellular phone 10) based on the radio waves from the GPS satellite (Step101). The measurement of the position of the user (position of cellularphone 10) is executed at a predetermined cycle.

Next, the controller 11 transmits, to the information management server,a request signal for positional information, primary information, andsecondary information on a real object 1 (e.g., shop and building)present within a predetermined distance d from the position of the user(Step 102). The distance d is, for example, about 10 m to 2 km (see FIG.4).

In response to the request signal, the information management servertransmits the positional information (i.e., an object position signal),primary information, and secondary information on the real object 1present within the predetermined distance d from the position of theuser.

Upon receiving the information on the real object 1 (YES in Step 103),the controller 11 judges whether there exists a real object 1 that hasinformation within the predetermined distance d from the position of thecellular phone 10 (position of user) (see FIG. 5) (Step 104).

When there is a real object 1 within the predetermined distance d fromthe position of the cellular phone 10 (YES in Step 104), the controller11 judges whether the real object 1 matches a preference of the user(Step 105).

Here, what kind of information on the real object 1 the user wishes isset in advance. For example, the controller 11 displays on the displayunit 12 a setting screen for setting which information on a real object1 belonging to what category is to be presented out of the real objects1 categorized into a plurality of categories.

FIG. 5 shows an example of the setting screen displayed for the user toset which information on a real object 1 belonging to what category isto be presented. In FIG. 5, the real objects 1 are categorized into acategory of “shop” and a category of “building”. In addition, in FIG. 5,the category of “shop” is categorized into categories of “restaurant”,“CD shop”, and “bookstore”, and the category of “building” iscategorized into “modern architecture” and “historical architecture”.

FIG. 5 shows an example of a case where a CD shop and a modernarchitecture are selected as the user preference. The real object 1selected based on the user preference is stored in the storage. When theuser wishes to change the setting, “o” and “x” are inverted by a touchoperation at portions where “o” and “x” are displayed.

Referring back to FIG. 3, when the real object 1 present within thepredetermined distance d from the position of the user (position ofcellular phone 10) is a real object 1 that matches the user preference(YES in Step 105), the controller 11 advances to Step 106. In Step 106,the controller 11 calculates a distance between the user (cellular phone10) and the real object 1 and an angle of a direction of a face of theuser with respect to the real object 1 (i.e., a direction in which thehead is facing with respect to another direction, connecting the headand the real object 1).

The distance between the user and the real object 1 is calculated basedon the positional information of the user and the positional informationof the real object 1. On the other hand, the angle of the direction ofthe face of the user is calculated based on the positional informationof the cellular phone 10, the positional information of the real object1, and information on a direction obtained by the direction sensor.

It should be noted that when a plurality of real objects 1 that match auser preference are present within the predetermined distance d from thecellular phone 10, the distance and the face direction are calculatedfor each of the plurality of real objects 1.

Next, the controller 11 calculates a distance for providing a soundsource of the real object 1 and a direction for providing the soundsource of the real object 1 (Step 107). The distance for providing thesound source of the real object 1 is determined based on the distancebetween the user and the real object 1. In other words, a sound sourceof a real object 1 that is close to the user is provided at a positionclose to the user whereas a sound source of a real object 1 that isdistant from the user is provided at a position distant from the user.The direction for providing the sound source of the real object 1 isdetermined based on the angle of the direction of the face of the userwith respect to the real object 1.

It should be noted that when a plurality of real objects 1 that match auser preference are present within the predetermined distance d from thecellular phone 10, the distance and direction of a sound source arecalculated for each of the plurality of real objects 1.

After calculating the distance and direction for providing the soundsource, the controller 11 controls an output of sound (i.e., audio)signals such that primary information is heard from a directioncorresponding to the direction of the face with respect to the realobject 1 (i.e., such that sound generated from the signals is perceivedby the user as arriving from the direction of the real object 1) at avolume corresponding to the distance between the real object 1 and theuser (Step 108).

As a result, primary information of a real object 1 that is presentwithin the predetermined distance d from the user and matches the userpreference is heard from the headphones 20. At this time, the primaryinformation is heard from a direction corresponding to the direction ofthe face with respect to the real object 1 at a volume corresponding tothe distance between the user and the real object 1 (see FIG. 4).

For example, simple information (primary information) such as “there isCD Shop A” (A: shop name) is heard from a direction corresponding to thedirection of the face of the user with respect to CD Shop A at a volumecorresponding to a distance between CD Shop A and the user. Moreover,simple information (primary information) such as “there is Tokyo Tower”is heard from a direction corresponding to the direction of the face ofthe user with respect to Tokyo Tower at a volume corresponding to adistance between Tokyo Tower and the user. As a result, the user canrecognize which of the real objects 1 such as a shop and a building thatmatches a user preference is present in an area where the user is in.The user can also intuitively recognize a direction and distance of thereal object 1 from him-/herself.

It should be noted that when the user turns his/her head and changes thedirection of the face while primary information of the real object 1 isbeing reproduced, the position of the sound source of the real object 1is adjusted such that the primary information of the real object 1 isheard from a direction in which the real object 1 is present. Forexample, when the user faces the left-hand direction at a time primaryinformation of the real object 1 is heard from the left-hand direction,the primary information of the real object 1 is then heard from thefront direction.

When primary information is reproduced while the cellular phone 10 isexecuting a main task such as music reproduction and video reproduction,the controller 11 may synthesize audio of the primary information withaudio of the music or video and output it. In this case, the controller11 may control sound signals such that a volume of the music or videobecomes small.

After controlling the output of sound signals with respect to theheadphones 20, the controller 11 next controls the output of soundsignals such that secondary information (specific information) of thereal object 1 is heard from the headphones 20 based on a user actionwith respect to the primary information (Step 109).

<Processing of Controller 11 when Secondary Information is Heard fromHeadphones 20 According to User Action with Respect to PrimaryInformation>

Next, processing of the controller 11 that is carried out when secondaryinformation is heard from the headphones 20 according to a user actionwith respect to primary information will be specifically described using3 examples.

FIG. 6 is a flowchart showing an example of the processing of thecontroller 11 that is carried out when secondary information is heardfrom the headphones 20 according to a user action with respect toprimary information.

As shown in FIG. 6, upon controlling sound signals such that primaryinformation of a real object 1 is heard from the headphones 20 (Step201) (see Step 108), the controller 11 next judges whether the user isfacing a direction from which the primary information is heard (Step202). Whether the user is facing the direction from which the primaryinformation is heard can be judged based on the information on adirection from the direction sensor.

When the user is facing the direction from which the primary informationis heard (YES in Step 202), the controller 11 judges whether the userhas shaken his/her head laterally (Step 204). In this case, thecontroller 11 judges whether the user has shaken his/her head laterallybased on a signal from the motion sensor of the sensor unit 30 providedin the headphones 20.

When the user has not shaken his/her head laterally (NO in Step 204),the controller 11 judges whether the user has nodded (Step 205). In thiscase, the controller 11 judges whether the user has nodded based on asignal from the motion sensor of the sensor unit 30 provided in theheadphones 20.

When the user is not facing the direction from which the primaryinformation is heard in Step 202 (NO in Step 202), the controller 11advances to Step 203. Similarly, when the user is facing the directionfrom which the primary information is heard but is not shaking his/herhead laterally nor nodding (NO in Step 205), the controller 11 advancesto Step 203.

In Step 203, the controller 11 judges whether a predetermined time haselapsed since an end of reproduction of the primary information (Step203). The time is, for example, about 5 to 10 seconds.

When a predetermined time has not elapsed since the end of reproductionof the primary information (NO in Step 203), the controller 11 returnsto Step 202 and judges whether the user is facing the direction fromwhich the primary information is heard.

When the user faces the direction from which the primary information isheard and shakes his/her head laterally before a predetermined timeelapses (YES in Step 204), the controller 11 ends the processing withoutreproducing the secondary information of the real object 1. As a result,the user can cancel the reproduction of the secondary information byfacing the direction from which the primary information of the realobject 1 is heard and shaking his/her head laterally.

For example, when the user faces, at a time primary information of“there is CD Shop A” is heard, a direction from which the primaryinformation is heard (direction in which CD shop is present) and shakeshis/her head laterally, reproduction of secondary information (specificinformation) of the CD shop is canceled. Similarly, when the user faces,at a time primary information of “there is Tokyo Tower” is heard, adirection from which the primary information is heard (direction inwhich Tokyo Tower is present) and shakes his/her head laterally,reproduction of secondary information (specific information) of TokyoTower is canceled.

When the user faces the direction from which the primary information isheard and nods before a predetermined time elapses (YES in Step 205),the controller 11 controls sound signals such that secondary informationof the real object 1 is heard from the headphones 20 (Step 206). In thiscase, the controller 11 controls the output of sound signals such thatthe secondary information of the real object 1 is heard from a directioncorresponding to the direction of the face with respect to the realobject 1 at a volume corresponding to the distance between the realobject 1 and the user.

For example, when the user faces, at a time primary information of“there is CD Shop A” is heard, a direction from which the primaryinformation is heard (direction in which CD shop is present) and nods,reproduction of secondary information (specific information) of the CDshop is started. In this case, specific information such as informationon a recommended CD and opening hours of CD Shop A is reproduced as thesecondary information.

Similarly, when the user faces, at a time primary information of “thereis Tokyo Tower” is heard, a direction from which the primary informationis heard (direction in which Tokyo Tower is present) and nods, secondaryinformation (specific information) of Tokyo Tower is reproduced. In thiscase, specific information such as the height of Tokyo Tower, thehistory of Tokyo Tower, and the hours during which Tokyo Tower islightened up is reproduced as the secondary information.

As described above, the user can grasp specific information on the realobject 1 by facing the direction from which the primary information ofthe real object 1 is heard and nodding.

It should be noted that when the user turns his/her head and changes thedirection of the face while secondary information of the real object 1is being reproduced, the position of the sound source is adjusted suchthat the secondary information of the real object 1 is heard from adirection in which the real object 1 is present. For example, when theuser faces the direction of the real object 1 and nods and turns thehead leftwardly after that to change the direction of the face 90°, thesecondary information of the real object 1 is heard from the right-handdirection.

When secondary information is reproduced while the cellular phone 10 isexecuting a main task such as music reproduction and video reproduction,the controller 11 may synthesize audio of the secondary information withaudio of the music or video and output it. In this case, the controller11 may control sound signals such that a volume of the music or videobecomes small.

Further, reproduction of the secondary information may be stopped as theuser shakes his/her head laterally while the secondary information isbeing reproduced.

When the user faces the direction from which the primary information isheard and a predetermined time elapses without the user shaking his/herhead laterally or nodding (YES in Step 203), the controller 11 ends theprocessing without reproducing the secondary information. In otherwords, when the user does not carry out any action with respect to theprimary information, the controller 11 ends the processing withoutreproducing the secondary information. As a result, the user can cancelthe reproduction of the secondary information by not carrying out anyaction with respect to the primary information.

Next, processing of the controller 11 that is carried out when secondaryinformation is heard from the headphones 20 according to a user actionwith respect to primary information will be described using anotherexample.

FIG. 7 is a flowchart showing another example of the processing of thecontroller 11 that is carried out when secondary information is heardfrom the headphones 20 according to a user action with respect toprimary information.

In the example of FIG. 6 described above, the user selects whether toreproduce secondary information of the real object 1 by shaking his/herhead laterally or nodding while facing the direction from which theprimary information of the real object 1 is heard. On the other hand,the example of FIG. 7 is different from that of FIG. 6 in that the usercan face any direction when selecting whether to reproduce secondaryinformation of a real object 1 by shaking his/her head laterally ornodding.

As shown in FIG. 7, upon controlling sound signals such that primaryinformation of a real object 1 is heard from the headphones 20 (Step301) (see Step 108), the controller 11 next judges whether the user hasshaken his/her head laterally (Step 302).

When the user has not shaken his/her head laterally (NO in Step 302),the controller 11 judges whether the user has nodded (Step 303). Whenthe user has not nodded (NO in Step 303), the controller 11 judgeswhether a predetermined time has elapsed since an end of reproduction ofthe primary information of the real object 1 (Step 304). Thepredetermined time is, for example, about 5 to 10 seconds as in the caseof the example shown in FIG. 6.

When the predetermined time has not elapsed since the end ofreproduction of the primary information of the real object 1 (NO in Step304), the controller 11 returns to Step 302 and again judges whether theuser has shaken his/her head laterally.

When the user shakes his/her head laterally before the predeterminedtime elapses since the end of reproduction of the primary information(YES in Step 302), the controller 11 ends the processing withoutreproducing secondary information of the real object 1. As a result, theuser can cancel the reproduction of the secondary information by shakinghis/her head laterally after the primary information is reproduced.

When the user nods before the predetermined time elapses since the endof reproduction of the primary information (YES in Step 303), thecontroller 11 controls sound signals such that the secondary informationof the real object 1 is heard from the headphones 20. In this case, thecontroller 11 controls the output of sound signals such that thesecondary information of the real object 1 is heard from a directioncorresponding to the direction of the face with respect to the realobject 1 at a volume corresponding to the distance between the realobject 1 and the user. As a result, the user can start the reproductionof the secondary information by nodding after the primary information isreproduced. The controller 11 may stop the reproduction of the secondaryinformation when the user shakes his/her head laterally while thesecondary information is being reproduced.

When the predetermined time has elapsed without the user shaking his/herhead laterally or nodding (YES in Step 304), the controller 11 ends theprocessing without reproducing the secondary information. As a result,the user can cancel the reproduction of the secondary information by notcarrying out any action with respect to the primary information.

Next, the processing of the controller 11 that is carried out whensecondary information is heard from the headphones 20 according to auser action with respect to primary information will be described usingstill another example.

FIG. 8 is a flowchart showing still another example of the processing ofthe controller 11 that is carried out when secondary information isheard from the headphones 20 according to a user action with respect toprimary information.

As shown in FIG. 8, upon controlling sound signals such that primaryinformation of a real object 1 is heard from the headphones 20 (Step401) (see Step 108), the controller 11 next starts reproduction ofsecondary information of the real object 1 (Step 402). In this case, thecontroller 11 controls the output of sound signals such that thesecondary information of the real object 1 is heard from a directioncorresponding to the direction of the face with respect to the realobject 1 at a volume corresponding to the distance between the realobject 1 and the user.

Next, the controller 11 judges whether the direction of the face of theuser with respect to the real object 1 has changed (Step 403). In thiscase, the controller 11 judges whether the direction of the face of theuser has changed based on the information on a direction from thedirection sensor.

When the user turns his/her head to change the direction of the face(YES in Step 403), the controller 11 adjusts a position of a soundsource of the real object 1 and the volume of the secondary informationaccording to the direction of the face of the user with respect to thereal object 1. At this time, the controller 11 controls the output ofsound signals such that the volume of the secondary information becomeslarger as the direction of the face of the user approaches the directionfrom which the primary information is heard (direction from whichsecondary information is heard).

When the direction of the face of the user has not changed (NO in Step403) and the position of the sound source of the real object 1 and thevolume of the secondary information have been adjusted (Step 404), thecontroller 11 advances to Step 405. In Step 405, the controller 11judges whether all of the secondary information have been reproduced.

When there is still secondary information to be reproduced (NO in Step405), the controller 11 returns to Step 403 and again judges whether thedirection of the face has changed. On the other hand, when all of thesecondary information have been reproduced (YES in Step 405), thecontroller 11 ends the processing.

By the processing shown in FIG. 8, the user can adjust the volume of thesecondary information to become larger by facing the direction fromwhich the secondary information is heard (direction in which real object1 is present).

In the descriptions on the first embodiment, although the cellular phone10 has been taken as an example of the sound control apparatus body 10,the sound control apparatus body 10 is not limited thereto. The soundcontrol apparatus body 10 may be a portable music player, a PDA(Personal Digital Assistance), a tablet PC (Personal Computer), a laptopPC, or the like.

Second Embodiment

Next, a second embodiment of the present disclosure will be described.In the first embodiment above, the case where primary and secondaryinformation on a real object 1 such as a shop and a building present inspace are heard from a direction corresponding to a direction of a faceof a user with respect to the real object 1 has been described. On theother hand, the second embodiment is different from the first embodimentabove in that a virtual object 2 such as an application program isvirtually arranged in space and primary and secondary information on thevirtual object 2 are heard from a direction in which the virtual object2 is provided. Therefore, that point will mainly be described.

[Overall Structure of Sound Control Apparatus 200 and Structures ofComponents]

FIG. 9 is a diagram showing a sound control apparatus 200 according to asecond embodiment of the present disclosure. FIG. 10 is a block diagramshowing an electrical structure of the sound control apparatus 200according to the second embodiment.

As shown in the figures, the sound control apparatus 200 of the secondembodiment includes a sound control apparatus body 50(display-unit-integrated PC 50), headphones 20 connected to the soundcontrol apparatus body 50, and a sensor unit 60. The sensor unit 60includes a first sensor unit 61 provided on the sound control apparatusbody 50 and a second sensor unit 62 provided on the headphones 20. Inthe second embodiment, the display-unit-integrated PC 50 is taken as anexample of the sound control apparatus body 50.

The display-unit-integrated PC 50 includes a controller 51, a displayunit 52, an input unit 53, a communication unit 54, and a storage 55.

The display unit 52 is constituted of, for example, a liquid crystaldisplay or an EL (Electro-Luminescence) display. The input unit 53 isconstituted of a keyboard, a mouse, a touch sensor, or the like andinputs instructions from a user and outputs them to the controller 51.

The storage 55 includes a volatile memory (e.g., RAM (Random AccessMemory) and a nonvolatile memory (e.g., ROM (Read Only Memory). Thestorage 55 includes an auxiliary storage apparatus such as an HDD (HardDisk Drive) and an SDD (Solid State Drive). The ROM, the HDD, and theSDD fixedly store various programs and various types of data requisitefor processing of the controller 51. The programs may be read out from aremovable recording medium (i.e., a non-transitory, computer-readablestorage medium) such as an optical disc and a semiconductor memory.

The controller 51 executes various operations based on the programsstored in the storage 55. The communication unit 54 transmits andreceives various types of data such as an email and web data undercontrol of the controller 51.

The first sensor unit 61 constituting a part of the sensor unit 60 isconstituted of an image pickup device such as a CCD (Charge CoupledDevice) sensor and a CMOS (Complementary Metal Oxide Semiconductor)sensor. Signals output from the first sensor unit 61 are A/D-convertedand input to the controller 51. The controller 51 judges a direction ofa face of a user based on an image taken by the first sensor unit 61.

The second sensor unit 62 constituting a part of the sensor unit 60 isconstituted of a motion sensor such as an acceleration sensor and anangular velocity sensor. Signals output from the second sensor unit 62are A/D-converted and input to the controller 51. The controller 51judges a movement of a head of a user based on the signals output fromthe second sensor unit 62.

Although the display-unit-integrated PC 50 and the headphones 20 areconnected by wires in FIGS. 9 and 10, the display-unit-integrated PC 50and the headphones 20 may communicate wirelessly.

[Descriptions on Operations]

Next, processing of the controller 51 of the display-unit-integrated PC50 of this embodiment will be described. FIG. 11 is a flowchart showingthe processing of the controller 51 of the display-unit-integrated PC 50of this embodiment. FIG. 12 is a complementary diagram for explainingthe processing shown in FIG. 11, the diagram showing virtual objects 2virtually arranged (i.e., positioned) in space.

As shown in FIG. 11, the controller 51 first virtually arranges virtualobjects 2 in space (Step 501). FIG. 12 shows an example of a case wherea mail program 2 a as the virtual object 2 is arranged on the right-handside of a user (right-hand side of headphones 20) and a mini blog 2 b asthe virtual object 2 is arranged on the left-hand side of the user(left-hand side of headphones 20).

As shown in FIG. 12, when the virtual objects 2 are arranged in space,the controller 51 is executing a main task such as music reproductionand video reproduction. At this time, the controller 51 controls soundsignals of the headphones 20 such that audio of music and video areheard from a direction corresponding to a direction of a face of theuser.

After arranging the virtual objects 2 in space, the controller 51 judgeswhether a mail of the mail program 2 a, tweets and comments of the miniblog 2 b, and the like have been received via the communication unit 54(Step 502).

When those pieces of information are received via the communication unit54 (YES in Step 502), the controller 51 next calculates an angle of adirection of the face of the user with respect to the virtual object 2(Step 503). In this case, the controller 51 judges the direction of theface of the user with respect to the screen from an image taken by thefirst sensor unit 61 (image pickup device). Then, the controller 51judges the direction of the face of the user with respect to the virtualobject 2 based on the direction of the face of the user with respect tothe screen.

Next, the controller 51 calculates a distance and direction forarranging a sound source of the virtual object 2 (Step 504). Thedirection for arranging the sound source of the virtual object 2 isdetermined based on the direction of the face of the user with respectto the virtual object 2.

Subsequently, the controller 51 controls sound signals such that primaryinformation on the virtual object 2 is heard from the headphones 20 fromthe position of the sound source of the virtual object 2 (Step 505). Itshould be noted that in this case, the controller 51 synthesizes audioof the primary information with audio of music, video, and the like(audio of main task) and outputs it. The controller 51 may executeprocessing of turning down the volume of the main task duringreproduction of the primary information.

For example, a case where the virtual object 2 is the mail program 2 awill be described. In this case, sound signals are controlled such thatsimple information such as “mail received from . . . ” is heard as theprimary information of the virtual object 2 from a direction in whichthe mail program 2 a is arranged. Alternatively, sound signals may becontrolled such that a simple alert sound such as “beep” is heard as theprimary information from the direction in which the mail program 2 a isarranged.

Next, a case where the virtual object 2 is the mini blog 2 b will bedescribed. In this case, sound signals are controlled such that simpleinformation such as “tweet from . . . ” or “comment from . . . ” isheard as the primary information of the virtual object 2 from adirection in which the mini blog 2 b is arranged. Alternatively, soundsignals may be controlled such that a simple alert sound such as “beep”is heard as the primary information from the direction in which the miniblog 2 b is arranged.

It should be noted that when the user turns his/her head to change thedirection of the face while the primary information of the virtualobject 2 is being reproduced, the position of the sound source of thevirtual object 2 is adjusted such that the primary information is heardfrom a direction in which the virtual object 2 is present. For example,when the user faces the left-hand direction at a time the primaryinformation of the virtual object 2 is heard from the left-hand side,the primary information of the virtual object 2 is then heard from thefront direction.

Next, the controller 51 controls sound signals such that secondaryinformation (specific information) is heard from the headphones 20according to a user action with respect to the primary information(simple information) (Step 506).

<Processing of Controller 51 when Secondary Information is Heard fromHeadphones 20 According to User Action with Respect to PrimaryInformation>

Next, processing of the controller 51 that is carried out when secondaryinformation is heard from the headphones 20 according to a user actionwith respect to primary information will be described. The processing inthis case is typically the same as the 3 examples of the firstembodiment described with reference to FIGS. 6 to 8 except that theobject arranged in space is not the real object 1 and is the virtualobject 2. Therefore, the processing in this case will be described withreference to FIGS. 6 to 8.

Referring to FIG. 6, when the user faces the direction from which theprimary information is heard (YES in Step 202) and shakes his/her headlaterally (YES in Step 204) before a predetermined time elapses sincethe end of reproduction of the primary information (NO in Step 203), thecontroller 51 ends the processing without reproducing the secondaryinformation. As a result, the user can cancel the reproduction of thesecondary information by facing the direction from which the primaryinformation is heard and shaking his/her head laterally.

Whether the user is facing the direction from which the primaryinformation is heard is judged based on the direction of the face of theuser that has been judged based on an image taken by the first sensorunit 61 (image pickup device). Further, whether the user has shakenhis/her head laterally is judged based on a signal detected by thesecond sensor unit 62 (motion sensor).

For example, when the user faces, at a time primary information of “mailreceived from . . . ” is heard, a direction from which the primaryinformation is heard (direction in which mail program 2 a is present)and shakes his/her head laterally, reproduction of secondary information(specific information) is canceled. Similarly, when the user faces, at atime primary information of “tweet from . . . ” or “comment from . . . ”is heard, a direction from which the primary information is heard(direction in which mini blog 2 b is present) and shakes his/her headlaterally, reproduction of secondary information (specific information)is canceled.

When the user faces the direction from which the primary information isheard (YES in Step 202) and nods (YES in Step 205) before apredetermined time elapses since the end of reproduction of the primaryinformation (NO in Step 203), the controller 51 controls the output ofsound signals such that secondary information is heard from a directioncorresponding to the direction of the face of the user with respect tothe virtual object 2 (Step 206). Whether the user has nodded is judgedbased on a signal detected by the second sensor unit 62 (motion sensor).

It should be noted that in this case, the controller 51 synthesizesaudio of the secondary information with audio of music, video, and thelike (audio of main task) and outputs it. The controller 51 may executeprocessing of turning down the volume of the main task duringreproduction of the secondary information.

For example, when the user faces, at a time primary information of “mailreceived from . . . ” is heard, a direction from which the primaryinformation is heard (direction in which mail program 2 a is present)and nods, reproduction of secondary information (specific information)is started. In this case, the controller 51 reproduces a mail content asthe secondary information.

Similarly, when the user faces, at a time primary information of “tweetfrom . . . ” or “comment from . . . ” is heard, a direction from whichthe primary information is heard (direction in which mini blog 2 b ispresent) and nods, reproduction of the secondary information (specificinformation) is started. In this case, the controller 51 reproduces acontent of the tweet or comment as the secondary information. A TTH(Text To Speech) technique is used as a method of converting a textcontent of mails, tweets, comments, and the like into audio.

As described above, the user can grasp secondary information as acontent more specific than the primary information by facing thedirection from which the primary information is heard (direction inwhich virtual object 2 is present) and nodding.

It should be noted that when the user turns his/her head and changes thedirection of the face while secondary information of the real object 1is being reproduced, the position of the sound source is adjusted suchthat the secondary information is heard from a direction in which thevirtual object 2 is present. For example, when the user faces thedirection of the virtual object 2 and nods and turns the head leftwardlyafter that to change the direction of the face 90°, the secondaryinformation is heard from the right-hand direction.

Further, reproduction of the secondary information may be stopped as theuser shakes his/her head laterally while the secondary information isbeing reproduced.

When the user faces the direction from which the primary information isheard and a predetermined time elapses without the user shaking his/herhead laterally or nodding (YES in Step 203), the controller 51 ends theprocessing without reproducing the secondary information. In otherwords, when the user does not carry out any action with respect to theprimary information, the controller 51 ends the processing withoutreproducing the secondary information. As a result, the user can cancelthe reproduction of the secondary information by not carrying out anyaction with respect to the primary information.

Referring to FIG. 7, when the user shakes his/her head laterally (YES inStep 302) before a predetermined time elapses since the end ofreproduction of the primary information (NO in Step 304), the controller51 ends the processing without reproducing the secondary informationsuch as a content of a mail, tweet, and comment. As a result, the usercan cancel the reproduction of the secondary information by shakinghis/her head laterally (irrespective of face direction) after theprimary information is reproduced.

When the user nods (YES in Step 303) before a predetermined time elapses(NO in Step 304), the controller 51 controls sound signals such that thesecondary information such as a content of a mail, tweet, and comment isheard from the headphones 20 (Step 305). In this case, the controller 51controls the output of sound signals such that the secondary informationis heard from a direction corresponding to the direction of the facewith respect to the virtual object 2.

By the processing as described above, the user can start reproduction ofthe secondary information by nodding (irrespective of face direction)after the primary information is reproduced. The controller 51 may stopthe reproduction of the secondary information when the user shakeshis/her head laterally while the secondary information is beingreproduced.

When the predetermined time has elapsed without the user shaking his/herhead laterally or nodding (YES in Step 304), the controller 51 ends theprocessing without reproducing the secondary information. As a result,the user can cancel the reproduction of the secondary information by notcarrying out any action with respect to the primary information.

Referring to FIG. 8, when the user faces the direction from which theprimary information is heard (direction from which secondary informationis heard) after reproduction of the secondary information is started(YES in Step 403), the secondary information is heard from a directioncorresponding to the direction of the face of the user with respect tothe virtual object 2, and the volume of the secondary informationbecomes larger as the direction of the face approaches the direction ofthe virtual object 2 (Step 404).

FIG. 13 shows a state where the user faces, after listening to primaryinformation on the mail program 2 a, a direction from which the primaryinformation is heard. As shown on the left-hand side of FIG. 13, whenaudio of a main task (audio of music, etc.) is heard from the frontside, primary information of “mail received from . . . ” is heard from aposition of a sound source of the mail program 2 a arranged on theright-hand side of the user. Next, a case where the user turns his/herhead to face a direction from which the primary information is heard asshown on the right-hand side of FIG. 13 will be described.

In this case, sound signals are controlled such that the secondaryinformation such as a mail content is heard from a directioncorresponding to the direction of the face of the user with respect tothe mail program 2 a. Similarly, sound signals are controlled such thataudio of a main task (audio of music) is heard from a directioncorresponding to the direction of the face of the user. Moreover, inthis case, sound signals are controlled such that the volume of thesecondary information becomes larger and the volume of the main taskbecomes smaller as the direction of the face approaches the directionfrom which the primary information is heard (direction from whichsecondary information is heard).

By the processing shown in FIG. 8, the user can turn up the volume ofthe secondary information by facing the direction from which thesecondary information is heard (direction in which virtual object 2 ispresent).

Modified Example of Second Embodiment

Next, a modified example of the second embodiment will be described. Asdescribed above, in the second embodiment, application programs such asthe mail program 2 a and the mini blog 2 b are arranged in space as thevirtual objects 2. Then, the controller 51 controls sound signals suchthat primary and secondary information are heard from directions of thevirtual objects 2. Since the primary and secondary information are heardfrom certain directions, the user already knows the directions in whichthe application programs such as the mail program 2 a and the mini blog2 b (virtual object 2) are arranged. Such a relationship is used in themodified example of the second embodiment.

FIG. 14 is a flowchart showing processing according to the modifiedexample of the second embodiment. FIG. 15 is a complementary diagram forexplaining the processing shown in FIG. 14, the diagram showing a statewhere the user utters a voice while facing a direction in which avirtual object 2 (mail program 2 a) is present.

As shown in FIG. 15, the controller 51 is executing a main task such asmusic reproduction and video reproduction. Referring to FIG. 14, thecontroller 51 judges whether the direction of the face of the user haschanged based on an image taken by the first sensor unit 61 (imagepickup device) (Step 601).

When the direction of the face of the user has changed (YES in Step601), the controller 51 judges whether an application program such asthe mail program 2 a and the mini blog 2 b (virtual object 2) isarranged in the direction in which the user is facing (Step 602). Inthis case, the controller 51 judges that there is an application program(virtual object 2) in the direction in which the user is facing whenthere is an application program within the range of, for example, ±45°from the front direction of the face of the user.

It should be noted that when the direction of the face of the user haschanged, the controller 51 controls the position of the sound sourcesuch that audio of music and video (audio of main task) is heard from adirection corresponding to the direction of the face of the user (seeleft- and right-hand sides of FIG. 15).

When judged YES in Step 602 (YES in Step 602), the controller 51executes audio recognition processing (Step 603). Voices uttered by theuser are collected by a microphone added to the headphones 20, forexample.

On the right-hand side of FIG. 15, an example of a case where the userutters “Dear . . . ” while facing the direction in which the mailprogram 2 a (virtual object 2) is present and utters a content of a mailthat is to be transmitted to . . . is shown. It should be noted thatsince primary (“mail received from . . . ”) and secondary information(mail content) are heard from the right-hand side, the user alreadyknows that the mail program 2 a is arranged on the right-hand side ofthe user.

After executing the audio recognition processing, the controller 51issues an audio command to the application program. As a result, thecontent of “Dear . . . ” uttered by the user is written in a mail text.

Further, when the user utters a content to be written in the mini blog 2b (tweet, comment, etc.) while facing the direction in which the miniblog 2 b is present, the controller 51 recognizes the audio. Then, thecontent uttered by the user with respect to the mini blog 2 b is writtenin the mini blog 2 b.

By the processing as described above, the user can write contents of amail and mini blog by utterance without inhibiting the main task.

Although the display-unit-integrated PC 50 has been taken as an exampleof the sound control apparatus body 50 in the descriptions on the secondembodiment and the modified example of the second embodiment, the soundcontrol apparatus body 50 is not limited thereto. The sound controlapparatus body 50 may be a desktop PC, a laptop PC, a tablet PC, acellular phone, a portable music player, a PDA (Personal DigitalAssistance), or the like.

Modified Examples

The descriptions above have taken the headphones 20 as an example.However, it is also possible to use earphones (sound output unit) inplace of the headphones 20.

In the descriptions above, the direction sensor, the motion sensor, andthe like provided in the headphones 20 have been taken as examples ofsensors that detect user actions with respect to primary information.However, the sensor for detecting a user action with respect to primaryinformation is not limited thereto.

For example, a microphone may be used as the sensor for detecting a useraction with respect to primary information. The microphone is providedin the sound control apparatus body 10 or 50 or the headphones 20, forexample.

In this case, for example, when primary information such as “there is ashop” and “mail received” is heard, the user utters a voice such as“yes” or “no” to the microphone as an action with respect to the primaryinformation. When judged that the voice of “yes” has been uttered, thecontroller 11 or 51 may output, as sound signals, secondary informationsuch as specific information on a shop and a mail content. The user mayutter “yes” or “no” while facing a direction in which the object such asa shop and the mail program 2 a is present or utter “yes” or “no”regardless of the direction in which the object such as a shop and themail program 2 a is present.

The present disclosure can also take the following structure.

(1) An apparatus, comprising:

-   -   a hardware processor; and    -   a storage medium coupled to the processor and storing        instructions that, when executed by the processor, cause the        apparatus to:        -   receive a direction signal indicating a first direction in            which a head of a user is facing with respect to a fixed            direction;        -   convert the first direction in which the head is facing with            respect to the fixed direction into a second direction in            which the head is facing with respect to a third direction,            connecting the head and an object; and        -   generate an audio signal having characteristics such that a            sound generated from the audio signal is perceived by the            user as arriving from the third direction.

-   (2) The apparatus of (1), comprising a direction sensor configured    to generate the direction signal.

-   (3) The apparatus of (1) or (2), wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to generate the audio signal such that the sound generated    from the audio signal indicates first information about the object.

-   (4) The apparatus of (3), wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to:    -   receive a movement signal indicating a movement of the head; and    -   selectively generate the audio signal such that the sound        generated from the audio signal indicates second information        about the object, based on the movement signal.

-   (5) The apparatus of (4), wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to generate the audio signal such that the sound generated    from the audio signal indicates the second information when the    movement signal indicates the head is nodded.

-   (6) The apparatus of claim (4), wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to generate the audio signal such that the sound generated    from the audio signal indicates the second information when (i) the    movement signal indicates the head is nodded and (ii) the direction    signal indicates the head is facing the object.

-   (7) The apparatus of any one of (4) to (6), comprising a movement    sensor configured to generate the movement signal.

-   (8) The apparatus of any one of (4) to (7), comprising an antenna    for receiving radio waves, wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to receive the first and second information from another    apparatus via the antenna.

-   (9) The apparatus of any one of (1) to (8), wherein the storage    medium stores instructions that, when executed by the processor,    cause the apparatus to:    -   receive a head position signal indicating a position of the        head;    -   receive an object position signal indicating a position of the        object; and    -   selectively generate the audio signal when a distance between        the position of the head and the position of the object is less        than or equal to a predetermined threshold distance.

-   (10) The apparatus of (9), wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to generate the audio signal such that a volume of the    sound generated from the audio signal corresponds to the distance    between the position of the head and the position of the of object.

-   (11) The apparatus of (9) or (10), comprising an antenna for    receiving radio waves, wherein the storage medium stores    instructions that, when executed by the processor, cause the    apparatus to receive the object position signal from another    apparatus via the antenna.

-   (12) The apparatus of any one of (1) to (11), wherein the storage    medium stores instructions that, when executed by the processor,    cause the apparatus to generate the audio signal such that the sound    generated from the audio signal has a higher volume when the head    faces the object than when the head does not face the object.

-   (13) The apparatus of any one of (1) to (12), comprising one of    headphones or earphones configured to receive the audio signal and    generate the sound from the audio signal.

-   (14) The apparatus of any one of (1) to (13), wherein:    -   the object is a virtual object; and    -   the storage medium stores instructions that, when executed by        the processor, cause the apparatus to position the virtual        object in space.

Although some embodiments have been described in detail with referenceto the accompanying drawings, the present disclosure is not limited tosuch embodiments. It should be understood by those skilled in the artthat various modifications, combinations, sub-combinations andalterations may occur depending on design requirements and other factorsinsofar as they are within the scope of the appended claims or theequivalents thereof. Further, it should be understood that, as usedherein, the indefinite articles “a” and “an” mean “one or more” inopen-ended claims containing the transitional phrase “comprising,”“including,” and/or “having.”

The present disclosure contains subject matter related to that disclosedin Japanese Priority Patent Application JP 2011-128983, filed in theJapan Patent Office on Jun. 9, 2011, the entire content of which ishereby incorporated by reference.

What is claimed is:
 1. An apparatus, comprising: a hardware processor;and a storage medium coupled to the processor and storing instructionsthat, when executed by the processor, cause the apparatus to: receive adirection signal indicating a first direction in which a head of a useris facing with respect to a fixed direction; convert the first directionin which the head is facing with respect to the fixed direction into asecond direction in which the head is facing with respect to a thirddirection, connecting the head and an object; and generate an audiosignal having characteristics such that a sound generated from the audiosignal is perceived by the user as arriving from the third direction,wherein the storage medium stores instructions that, when executed bythe processor, cause the apparatus to generate the audio signal suchthat the sound generated from the audio signal indicates firstinformation about the object, and wherein the storage medium storesinstructions that, when executed by the processor, cause the apparatusto: receive a movement signal indicating a movement of the head; andselectively generate the audio signal such that the sound generated fromthe audio signal indicates second information about the object based onthe movement signal, wherein content of the second informationsupplements the first information about the object.
 2. The apparatus ofclaim 1, comprising a direction sensor configured to generate thedirection signal.
 3. The apparatus of claim 1, wherein the storagemedium stores instructions that, when executed by the processor, causethe apparatus to generate the audio signal such that the sound generatedfrom the audio signal indicates the second information when the movementsignal indicates the head is nodded.
 4. The apparatus of claim 1,wherein the storage medium stores instructions that, when executed bythe processor, cause the apparatus to generate the audio signal suchthat the sound generated from the audio signal indicates the secondinformation when (i) the movement signal indicates the head is noddedand (ii) the direction signal indicates the head is facing the object.5. The apparatus of claim 1, comprising a movement sensor configured togenerate the movement signal.
 6. The apparatus of claim 1, comprising anantenna for receiving radio waves, wherein the storage medium storesinstructions that, when executed by the processor, cause the apparatusto receive the first and second information from another apparatus viathe antenna.
 7. The apparatus of claim 1, wherein the storage mediumstores instructions that, when executed by the processor, cause theapparatus to: receive a head position signal indicating a position ofthe head; receive an object position signal indicating a position of theobject; and selectively generate the audio signal when a distancebetween the position of the head and the position of the object is lessthan or equal to a predetermined threshold distance.
 8. The apparatus ofclaim 7, wherein the storage medium stores instructions that, whenexecuted by the processor, cause the apparatus to generate the audiosignal such that a volume of the sound generated from the audio signalcorresponds to the distance between the position of the head and theposition of the of object.
 9. The apparatus of claim 7, comprising anantenna for receiving radio waves, wherein the storage medium storesinstructions that, when executed by the processor, cause the apparatusto receive the object position signal from another apparatus via theantenna.
 10. The apparatus of claim 1, wherein the storage medium storesinstructions that, when executed by the processor, cause the apparatusto generate the audio signal such that the sound generated from theaudio signal has a higher volume when the head faces the object thanwhen the head does not face the object.
 11. The apparatus of claim 1,comprising one of headphones or earphones configured to receive theaudio signal and generate the sound from the audio signal.
 12. Theapparatus of claim 1, wherein: the object is a virtual object; and thestorage medium stores instructions that, when executed by the processor,cause the apparatus to position the virtual object in space.
 13. Amethod, comprising: receiving a direction signal indicating a firstdirection in which a head of a user is facing with respect to a fixeddirection; converting the first direction in which the head is facingwith respect to the fixed direction into a second direction in which thehead is facing with respect to a third direction, connecting the headand an object; and generating an audio signal having characteristicssuch that a sound generated from the audio signal is perceived by theuser as arriving from the third direction, wherein the generating theaudio signal is such that the sound generated from the audio signalindicates first information about the object; receiving a movementsignal indicating a movement of the head; and generating the audiosignal such that the sound generated from the audio signal indicatessecond information about the object based on the movement signal,wherein content of the second information supplements the firstinformation about the object.
 14. A non-transitory, computer-readablestorage medium storing instructions that, when executed by a processor,cause an apparatus to: receive a direction signal indicating a firstdirection in which a head of a user is facing with respect to a fixeddirection; convert the first direction in which the head is facing withrespect to the fixed direction into a second direction in which the headis facing with respect to a third direction, connecting the head and anobject; and generate an audio signal having characteristics such that asound generated from the audio signal is perceived by the user asarriving from the third direction, wherein the storage medium storesinstructions that, when executed by the processor, generate the audiosignal such that the sound generated from the audio signal indicatesfirst information about the object, and wherein the storage mediumstores instructions that, when executed by the processor, that cause:reception of a movement signal indicating a movement of the head; andselective generation of the audio signal such that the sound generatedfrom the audio signal indicates second information about the objectbased on the movement signal, wherein content of the second informationsupplements the first information about the object.